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SUMMARY 
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--^'A  class  of  simple  and  robust  estimators  of  the  difference  between  location 
parameters  of  correlated  variables  is  proposed  when  some  observations  on  either 
of  the  variables  are  missing.  We  show  that  these  estimators  are  consistent, 
asymptotically  normally  distributed,  and  insensitive  to  outlying  observations. 
Asymptotic  relative  efficiency  comparisons  with  other  known  estimators  are  made 


to  show  the  advantage  of  the  proposed  estimators 


~  >  f 


Sone  keywords:  Asymptotic  relative  efficiency;  bivariate  exponential 

distribution;  consistent;  Hodges-Lehmann  estimator;  median. 
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1.  INTRODUCTION 


The  problem  of  estimating  the  difference  between  means  of  a  bivariate 
normal  distribution  when  some  observations  on  either  of  the  variables  are 
missing  has  received  a  great  deal  of  attention  in  recent  statistical  literature 
(Wilks,  1932;  Anderson,  1957;  Hocking  and  Smith,  1968;  Mehta  and  Gurland,  1969; 
Lin,  1971,  1973;  Lin  and  Stivers,  1974).  In  this  article  we  study  the  problem 
of  estimation  of  the  difference  between  the  location  parameters  of  correlated 
variables  from  fragmentary  samples  when  the  population  being  sampled  is  not 
necessarily  normal.  More  specifically,  let  (X,  Y-e)1  be  a  random  vector  with 
absolutely  continuous  joint  distribution  function  H  which  is  free  of  0  and  is 
symmetric  in  its  arguments,  i.e.  H(u,v)  =  H(v,u),  for  (u,v)'eR2.  Also,  let 
(X.j.Y^)',  i=l,...,n  be  n  pairs  of  observations  on  (X ,  Y ) ' ;  Xp+j  ,  j=l,...,s, 

be  s  additional  observations  on  X;  Y^,  k=l,...,t,  be  t  additional 
observations  on  Y.  The  (X^Y^)',  Xp+j ,  and  Yn+k  are  assumed  to  be  mutually 

independent  for  i=l,...,n,  j=l,...,s  and  k=l,...,t.  The  problem  is  how  to  use 
the  fragmentary  sample  in  the  most  efficient  way  to  estimate  the  shift  parameter 
e.  Gupta  and  Rohatgi  (1981)  considered  the  case  that  X  and  Y  are  linearly 
related  and  constructed  regression  estimators  which  are  linear  combinations  of 
fragmentary  sample  means.  Therefore,  their  estimators  are  sensitive  to  outlying 
observati ons. 

A  class  of  simple  and  robust  estimators  6  of  6  is  proposed  in  Section  2. 

We  show  that  these  estimators  are  unbiased  if  the  underlying  distribution  H  is 
symmetric  about  some  point  or  the  two  fragmentary  sample  sizes  are 

A 

equal,  i.e.,  s=t.  Also,  it  is  shown  that  e  is  consistent  and  asymptotically 
normal ly  distributed.  In  Section  3  we  compare  the  asymptotic  relative 
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efficiency  of  9  with  other  known  estimators. 

2.  THE  ESTIMATOR  6 

For  the  fragmentary  sample  given  in  Section  1,  we  take  all  possible 
differences  Y j -X-j ,  i  =1 , . . .  ,n+s,j=l, . . .  ,n+t ,  each  with  multiplicity  a.  We 
also  take  all  possible  Walsh  differences  ^(Y^+Y^  t-X^  -X^ . ) ,  Hik.i'^n,  each 

with  multiplicity  8.  Denote  the  ordered  set  of  the  above  M=o(n+s) (n+t)+en(n+l)/2 
differences  by  where  a  and  8  are  nonnegative  integers.  A 

natural  estimator  6  of  9  i s  then  the  median  of  the  D's,  i.e., 

„  „  f  ii(0(i)'hD(i+i) )»  a(n+s)  (n+t)+en(n+l)/2=2«, 

0  =0(« ,6)  =  { 

l  D(A+i),  a(n+s)  (n+t)+Bn(n+l)/2=2Jt+l. 

Mote  that  the  estimator©  (0,1)  which  disregards  the  information  from  the 
incomplete  pairs  is  a  Hodges-Lehmann  estimator  of  9  based  on  Wilcoxon  signed 
rank  statistic.  The  estimator  0(1,0)  which  uses  all  the  data  points  but  ignores 
the  pairing  information  is  based  on  the  two-sample  Mann-Whitney-Wi Icoxon 
stati Stic. 

First,  it  is  proved  in  the  Appendix  that  the  distribution  of  the  difference 
9(a,8)-0  is  free  of  9  and  the  estimator  0(a,8)  is  distributed  symmetrically 
about  e  if  either  of  the  following  two  conditions  hold:  (a)  the  distribution  H 
is  symmetric  about  some  point  (jq,  u2 ) ’ *  (b)  the  two  fragmentary  sample 

sizes  are  equal,  i.e.,  s=t.  Thus  under  the  stated  conditions,  the  estimator  9 
is  unbiased.  Now,  let  us  consider  the  asymptotic  performance  of  e. 

Theorem  1.  Let  the  distribution  functions  of  Xj  and  Y^-e-X^  be  denoted  by 
F  and  G.  The  corresponding  density  functions  f  and  g  of  F  and  G  are  assumed  to 


-v, 
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satisfy  the  mild  conditions  /f^(u)du<®  and  /g2(u)du<®,  respecti vely.  Also, 
let  N=2n+s+t  and  A1.A2.A3  be  nonegative  numbers  such  that  Ai+A3>0, 

A2+A3>0  and  +X2+2X3=1  •  Then,  as  N-*«,  s/N-^Xj,  t/N-*A2,  n/N-*\3,  the 

distribution  of  0(a,8)-6)  converges  to  a  normal  distribution  with  mean  0  and 
2 

variance  o  (0)  = 

[a2 {l/12+A3(l/2-/F(u)F(v)dH(u,v))}+82A^/{12(A3+A1)(A3+A1)}+ 
aBA§{l/2-2/F(u)G(v-u)dH(u,v)}/{(A3+A1)(A3+A2)}]/[a(A3+A1)i'2(A3+A2)!i 

/f2(u)du+BA2  /g2(u)du/{ ( A3+Aj )( A3+A2) }]. 

The  above  theorem  also  shows  that  0  (a, 8)  is  consistent.  We  note  that  the 
computation  of  0  requires  finding  the  median  of  the  a(n+s) (n+t)+Jsen(n+l) 
differences  and  becomes  rather  tedious  for  a  large  set  of  data.  Fortunately, 
there  are  several  shortcut  methods  of  obtaining  0  (Lehmann,  1975). 

3.  THE  EFFICIENCY  OF  0 


The  estimator  0  is  compared  with  the  estimator  0]_  (Lin  and  Stivers,  1974, 
(2.4)),  a  regression  estimator  ©3  (Gupta  and  Rohatgi,  1979,  (9))  and  a  naive 

n+t  n+s 

estimator  0..=?-5(,  where  7=  I  Y./(n+t)  and  K=  I  X./(n+s).  All  the  estimators 

.  1  I  .  -  1 

i=l  i=l 

mentioned  above  are  consistent  and  asymptotically  normally  distributed. 

Let  9  and  9*  be  asymptotically  unbiased  estimators  for  a  parameter  0  in  the 
sense  that  both  N^e-e)  and  N^e'-o)  have  asymptotic  distributions  with  zero 
means.  The  asymptotic  relative  efficiency  of  0  with  respect  to  8',  denoted  by 
ARE (0,9'),  is  defined  by 


K. 
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ARE  (0,0')  =  o2(el  )  , 
o2(6) 

where  a2(e)  =  1 i m  var  (N^e)  and  a2(e')  =  lim  var  (N^e‘)  (c.f.  Randles  and 
N  -*■-  N  -*» 

Wolfe  (1979),  p.  227). 

Two  bivariate  distributions  were  considered  for  H  in  the  compari son  study: 

(a)  A  bivariate  normal  distribution  with  unit  variance  and  correlation 
coefficient  p. 

(b)  A  bivariate  exponential  distribution  (Gumbel  (I960)).  The  joint 
distribution  function  is 

H(u,v )=  F (u)F(v)  [l+t{ 1-F(u)}{ 1-F( v)} ], 

where  F(u)=l-e“u,  -1<t<1  and  p=x/4. 

The  advantage  of  using  Gumbel's  bivariate  exponential  distribution  for  our 
comparison  study  is  that  the  asymptotic  variance  a2(e)  of  0  in  Theorem  1  has  a 
closed  form.  For  each  family  of  bivariate  distributions  mentioned  above  and  a 
group  of  selected  values  of  p  and  t,  the  asymptotic  relative  efficiencies 
e-j  (a,B)=ARE(0(a,6)  ,0-j ) ,  i=l,2,3,  were  computed.  Table  1  provides  the 
computational  results  for  (A):  x^=0.2,  X2=.2,  X3=0.3  and  (B):  x^O.05, 

X2=.35,  X3=.3  under  the  normal  distribution.  The  estimator  0  (1,0)  performs 

as  well  as  e-j  (i  =1,2,3)  except  for  large  values  of  p.  The  estimator  0(1,1)  is 
more  efficient  than  e  (1,0)  for  positive  p.  This  is  not  surprising  because 
6(1,0)  ignores  the  pairing  information.  However,  if  the  ratio  B/o  is  too  large, 
the  efficiency  of  the  estimator  e(a,B)  becomes  rather  low  except  for  extremely 
large  p. 

Results  similar  to  Table  1  were  also  found  for  other  combinations  of  X^ , 
i =1 ,2,3.  The  choice  of  a=l  and  0=1  generally  yields  a  good  estimator  which  can 
be  applied  for  a  wide  range  of  values  of  p  without  requiring  the  user  to  specify 
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or  estimate  the  correlation  coefficient  p. 

The  estimator  9  does  much  better  than  e-j ,  i=l,2,3,  for  heavy  tail 
distributions  as  is  to  be  expected  because  ©-j  (i  =1,2,3)  is  a  linear  combination 
of  fragmentary  sample  means  (See  Table  2).  In  particular,  if  H  is  a  bivariate 
Cauchy  distribution  (Johnson  and  Kotz,  1976,  p.  295)  which  is  not  shown  in  our 
tables,  the  ARE  (  6,  9i  )=®,i=l,2,3. 


4.  REMARKS 

An  interesting  feature  of  9(1,0)  is  that  it  can  be  used  even  when  the 
pairing  of  with  Yj,  i=  l,...,n,  cannot  be  identified  (c.f.  Hollander, 

Pledger,  and  Lin  (1974)).  For  example,  a  statewide  readiness  test  was  given  at 
the  beginning  of  the  1979-80  school  year  to  every  incoming  first  grade  public 
school  student  of  South  Carolina.  The  purpose  of  this  test  was  to  distinguish 
those  students  who  were  ready  for  the  formal  first  grade  curriculum  from  those 
who  were  not  ready.  A  pilot  testing  was  conducted  to  obtain  the  cutoff  score 
using  a  random  sample  of  South  Carolina's  kindergarten  students  at  the  end  of 
the  1978-79  school  year.  Educators  have  constantly  demonstrated  that  in  the 
very  early  years  of  schooling  a  vast  amount  of  a  student's  achievement  is  caused 
by  maturation  and  not  necessarily  by  instruction.  This  coupled  with  the  fact 
that  these  two  tests  were  conducted  approximately  four  months  apart  establishes 
a  concern  as  to  how  much  the  cutoff  score  previously  determined  by  the  pilot 
test  should  be  moved  upward.  So  the  problem  becomes  estimating  "maturational 
growth"  occurred  during  the  summer  months.  However,  many  of  the  students  in  the 
pilot  test  cannot  be  identified  at  the  data  analysis  time  due  to  various  human 
factors.  Therefore,  all  the  parametric  and  regression  estimation  procedures 
mentioned  before  are  not  valid.  A  similar  example  was  also  cited  by  Hollander, 
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Pledger  and  Lin  (1974). 

Theorem  1  can  be  used  to  construct  a  test  of  e=e0,  for  any  eQ.  By 
looking  at  the  acceptance  regions  of  such  tests,  one  could— in  theory— construct 
a  confidence  interval  of  8  based  on  the  fragmentary  samples. 

5.  APPENDIX 


Proof  of  small  sample  properties  of  8.  The  proof  that  the  distribution  of  the 
difference  0(a,6)-e  is  free  of  6  is  strai ghtforward.  The  proof  of  the  symmetry 
of  the  distribution  of  3(a,s)  about  9  is  similar  to  that  given  by  Lehmann,  1975, 
Theorem  3,  p.  86. 

Two  lemmas  are  needed  to  prove  Theorem  K  First,  let  us  define  a  scoring 
function  <t>  for  comparing  two  observations  and  Y j  by 

r  l,  xi<Yi, 

<<Xi’Yj)H 

<■  0,  otherwise. 


and 


n+s  n+t  ,  n  n 

Wx>y  =a  l  l  tfX^YjO+B  l  l  ♦(Xi+Xi',  Y1+Y1'). 

i =1  j-1  i-1  i ' =1 

Lemma  1.  For  any  real  number  c  and  integer  i  between  1  and  a(n+s)(n+t)+ 
^0n(n+l),  the  ith  ordered  difference  D(-j)i.c  If  and  only  if 

^X, Y-c— a(n+s)  ^n+t)+  ^n(n+l  )-i . 

Proof.  C.f.  Theorem  4  of  Chapter  2,  p.  87,  of  Lehmann,  1975. 

Lemma  2.  For  9=0  and  a  positive  real  c,  the  distribution  of 

((n+s)(n+t)N)_Ji{wx>Y.c/v^'  -a(n+s)(n+t)p1-Bn(n+l)p2/2} 

converges  to  a  normal  distribution  with  mean  0  and  variance 
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o  -a  [l/12+\3( l/2-2/F(u)F ( v)dH(u,v )} ]+$  X3/{ 12( X3+X1 ) ( X3+X2)>  + 

oBX§{l/2-2/F(u)6(v-u)dH(u,v)}/{(X3+X1)(X3+X2) ), 

as  N-*-  ”,  s/N-i-Xj,  t/N-^2,  n/N-*A3,  where  p3=  /F(u+c/ vfT)dF ( u) ,  P(.  )=1-F(. ) 
and  P2=/G(u-2c/ vfT)dG(u) . 

Proof.  For  convenience,  let  us  define  a  sequence  of  fragmentary  samples 
(Xl,m’  Y1  ,m) ' » *  *  * ’  (Xnm,m>  Ynm,m)l;  Xnm+l,rn . Xnm+sm,m;  Ynm+l,rn  ’**•’ 


m  m  » . m 


Yn  m,  where  (X^  m,  m+c/ 4^) '  has  distribution  function  H  (i=l,...,nm, 

Nm=2nm+s(n+tm) ,  Xi  m  and  Y  j  m  have  distribution  functions  F(x)  and  F(y+ 
c//Tm),  i  =1 , . . .  ,nm+  »  ,  j»l,...,nm+tm,  respectively,  and  x3,  tm/Nm-X2, 


n  /N  - 
m  m 


X  ,  as  m->  «■.  Furthermore,  we  assume  that  (X  ,Y  )',  Xn  m 
3  i  ,m  i  ,m  nm  J  *  ' 


Yn  +k  m  are  mutually  independent  for  i=l,...,  nm,  j  =  l,...,sm,  k=l,...,tm  and 
(Xl,m,Yl,m)  ,  =  (U,v-c/yftm) '  ,  ni>l,  where  (U,V)'  has  distribution  function  H. 

Now,  let  p1>m  =/F(u+c//Rm)dF(u)  p2>m  =/G(u-2c//Rm)dG(u) , 
i  N  +s  N  +t 

''Jm=^nm+sm^nm+tm^  V  (a  I  1  ^xi  ,m,Yj,m^"pl,m^  + 

y  i =1  j-1 

n  n 

6  ^xi  ,m+Xi 1  ,m’  Yi  ,m+Yi  ‘  .m-^Pz.mj*  anc* 


=  a|(n  +t  )N  '  ^  (^Xi  ,m+c/,/Trm)_pl,m}  +  a 

m  ‘'m'  nm  J 


h  nm+si 


nm~t~sm 

("m+tm>Nm 


h  nm+tr 


^  {F ^ Y J  ~pl ' 

j-1 


^  ... 
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Then,  it  follows  from  the  argument  provided  by  Hollander,  Pledger  and  Lin  (1974, 

p.  179)  that  U*  and  have  the  same  limiting  distribution. 

*  / 

The  asymptotic  normality  of  Tm  follows  from  a  version  of  Berry-Esseen 

Theorem  (Chung,  1968,  Theorem  7.1.2.,  p.  185). 

Proof  of  Theorem  1.  For  any  real  c,  P.  (N 2( e(ot,0)-e)<c )  = 

Pg(iN^0(a,0)<c)=Po(e(a,B)<c/v4i:).  Without  loss  of  generality,  we  can  assume 

that  e=C.  Consider  first  the  case  a(n+s)  (n+t)+J56n(n+l)  =  2jJ+l.  By  Lemmas  1  and  2 

the  fact  that  /f^(u)du<=>  and  /g^(u)du<®  (Olshen,  1967,  and  Mehra  and 

Sarangi ,  1967) , 


P0(D(,+1)<c/^).P0(UXj¥.c//jf  <  ^tslinjj^nln.11.^1) 


5  P, 


{(ntsHn+tJNj’15  {“x,r-c//S-“(nts)(n+t>PrBn(n+1>P2/2j< 


(i, <+->)( t»t>) 


N 


oN 


BA2  N1* 


(1/2  -/  F(u+c//!T)dF (u))+  2(x~+x~)(x3+Tz)^/Z  -  J  G(u-2c//R)dG(u)  } 


•— *  ^c|a(A3+A1)'2(A3+A2)'i/f2(u)du  +  BA32/g2(  u)du/ ( ( A3+Aj )  ( A3+A2)  )^J/n 
as  N  -+~,  s/N-*-A^,  t/N-*-A2.  n/N-(-A3,  where  *  is  the  distribution  function  of 


,  (A-l) 


N(0,1). 

In  the  case  a(n+s) (n+t)+^0n(n+l)=2£,  the  probability  Pg( e^c/Zfj")  is  bounded 
below  and  above  by  Pq(D^+1) <c/*4T)  and  Pg(D^^<c/vfT).  By  the  same 

argument,  it  can  be  shown  that  these  two  probabi 1 ities  have  the  same  limiting  value 
(A-l). 


r<r 


TABLE  1.  THE  ASYMPTOTIC  RELATIVE  EFFICIENCY  ej(a,0)  X  100  UNDER 
BIVARIATE  NORMAL  DISTRIBUTION 

(A) :  A2=0.2,  \2=0.2,  X3=0.3 

(B) :  Xj=0.05,  X2=0*35,  X3=0.3 


Correl ation 

coefficient 

ej(l 

.0) 

,0) 

e3(l,0) 

e^l.l) 

e  2  ( 1 

,1) 

e3d.l> 

p 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

-0.8 

89 

84 

89 

84 

96 

96 

82 

77 

82 

77 

89 

88 

-0.6 

91 

89 

93 

89 

96 

96 

85 

82 

86 

83 

90 

89 

-0.4 

94 

92 

96 

94 

96 

96 

88 

87 

91 

88 

91 

90 

-0.2 

95 

95 

99 

97 

96 

96 

91 

91 

95 

93 

92 

92 

0.0 

96 

96 

101 

98 

96 

96 

94 

94 

99 

97 

94 

94 

0.2 

94 

94 

100 

97 

95 

95 

96 

96 

101 

99 

97 

97 

0.4 

89 

88 

94 

90 

94 

94 

96 

95 

102 

98 

102 

103 

0.6 

77 

75 

82 

77 

94 

94 

93 

91 

98 

94 

113 

115 

0.8 

53 

50 

55 

51 

94 

94 

81 

79 

84 

80 

143 

148 

TABLE  2. 

THE  ASYMPTOTIC 

RELATIVE 

EFFIC 

IENCY 

ej  (a 

,3)  X100 

UNDER 

BIVARIATE 

EXPONENTIAL  DISTRIBUTION 


(A) :  x^-0.2,  X2=0. 2,  X-j-0.3 

(B) ;  X^  =0. 05,  X2~0. 35,  X3=0.3 


X 

el 

(1,0) 

e2(l 

,0) 

e3U 

,0) 

e!  ( 1 

,D 

e2(  1 

,D 

e3(M) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

(A) 

(B) 

-0.8 

287 

286 

299 

293 

290 

290 

243 

240 

254 

246 

245 

243 

-0.6 

291 

290 

304 

297 

292 

292 

248 

245 

259 

252 

249 

247 

-0.4 

294 

293 

308 

300 

294 

294 

253 

251 

265 

257 

253 

251 

-0.2 

297 

297 

312 

305 

297 

297 

258 

256 

271 

263 

258 

256 

0.0 

300 

300 

316 

309 

300 

300 

263 

262 

277 

269 

263 

262 

0.2 

303 

303 

320 

312 

303 

303 

269 

267 

284 

275 

269 

267 
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